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RAW SEQUENCE LISTING DATE: 08/13/2001 

PATENT APPLICATION: US/09/529,962 TIME: 09:51:12 



^10 



ENTERED 
ENTEP^^ 



Input Set : A:\06501-058001.TXT 
Output Set: N:\CRF3\08132001\I529962.raw 

4 <110> APPLICANT: Ota, Toshio 

5 Nishikawa, Tetsuo 

6 Salamov, Asaf 

7 Isogai, Takao 

9 <120> TITLE OF INVENTION: METHOD FOR SCREENING FULL-LENGTH C 
10 CLONES 

12 <130> FILE REFERENCE: 06501-058001 
14 <140> CURRENT APPLICATION NUMBER: 09/529,962 
C--> 15 <141> CURRENT FILING DATE: 2000-12-18 

17 <150> PRIOR APPLICATION NUMBER: JP 9/289982 

18 <151> PRIOR FILING DATE: 1997-10-22 

20 <150> PRIOR APPLICATION NUMBER: PCT/JP98/04772 

21 <151> PRIOR FILING DATE: 1998-10-21 
23 <160> NUMBER OF SEQ ID NOS : 18 

25 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 30 

29 <212> TYPE: RNA 

30 <213> ORGANISM: Artificial Sequence 

32 <220> FEATURE: y 

33 <223> OTHER INFORMATION: Oligo-capping linker sequence 

35 <4 00> SEQUENCE: 1 

36 agcaucgagu cggccuuguu ggccuacugg 30 

38 <210> SEQ ID NO: 2 

39 <211> LENGTH: 42 

40 <212> TYPE: DNA 

41 <213> ORGANISM: Artificial Sequence/ 

43 <220> FEATURE: 

44 <223> OTHER INFORMATION: Oligo(dT) adapter primer sequence 

46 <400> SEQUENCE: 2 

47 gcggctgaag acggcctatg tggccttttt tttttttttt tt 42 

49 <210> SEQ ID NO:-^ 

50 <211> LENGTH: 

51 <212> TYPE: DnV"""^ 

52 <213> ORGANISM: Artificial Sequence 

54 <220> FEATURE: 

55 <223> OTHER INFORMATION: Random adapter primer sequence 

57 <221> NAME/KEY: misc_feature 

58 •<222> LOCATION: (l)...(f32K) , 

59 <223> OTHER INFORMATIOth—n = A,T,C or G^Y^ 
61 <400> SEQUENCE: 3 

W--> 62 gcggctgaag acggcctatg tggccnnnnn nc 32 

64 <210> SEQ ID NQj_4 

65 <211> LENGTH: <^o) 

66 <212> TYPE: DNA^ 

67 <213> ORGANISM: Homo sapiens 
69 <220> FEATURE: 



,.„„^ 
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RAW SEQUENCE LISTING DATE: 08/13/2001 

PATENT APPLICATION: US/09/529,962 TIME: 09:51:12 

Input Set : A:\06501-058001.TXT 

Output Set: N:\CRF3\08132001\I529962.raw 

<221> NAME/KEY: misc_featpre 
<222> LOCATION: (l).../&^0/ 

<223> OTHER INFORMATIONV^n = A,T,C or G ^A/^ 
<400> SEQUENCE: 4 

atgcgcccgc gcggccctat aggcgcctcc tccgcccgcc gcccgggagc cgcagccgcc 60 
gccgccactg ccactcccgc tctctcagcg ccgccgtcgc caccgccacc gccactgcca 120 
ctaccaccgt ctgagtctgc agtccc.gaga tcccagccat catgtccata gagaagatct 180 
gggcccggga gatcctggac tcccgcggga accccacagt ggaggtggat ctctatactg 240 
ccaaaggtcc tttccgggct gcagtgccca gtggagcctc tacgggcatc tatgaggccc. 300 
tggagctgag ggatggagac aaacagcgtt acttaggcaa aggtgtcctg aaggcagtgg 3 60 

accacatcaa ctccaccatc gcgccagccc tcatcagctc aggtctctct gtggtggagc 420 
aagagaaact ggacaacctg atgctggagt tggatgggac tgagaacaaa tccaagtttg 480 
gggccaatcc atcctgggtg tgtctctggc cgtgtgtaag gcangggcaa ctgaacngga 540 
actgcccctg tatcgccaca ttgctcagct tggncgggaa ctcanacctc atcctgcctg 600 
ttgccggcct tcaacgtgat caatggttgg cttctcatgc ctggcaacaa anctggccat 660 
tgcnggaatt ttcatgatcc tccccnttgg gaaactgaaa aactttccgg aatgcccntc 720 
caactaagtt gcaaaaggtc taccnatacc ccccaagggg aattcctcca agggaacaaa 780 
tncccgggaa aggaatgccc cccaattntt ngggggaata aaaggtgggc tttgcccccc 840 
cattttcctg gaaaaaacna tnaaaaccct tgggaaactt 880 
<210> SEQ ID NO^r^ 
<211> LENGTH: (^4^ 
<212> TYPE: DNa" 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 

<221> NAME/KEY: misc_f eatore 

<222> LOCATION: (1) . , , (6^y . 
<223> OTHER INFORMATIONV-n = A,T,C or G 
<400> SEQUENCE: 5 

* tgtgcgttac ttacctcnac tcttagcttg tcggggacgg taaccgggac ccggtgtctg 60 
ctcctgtcgc cttcgcctcc taatccctag ccactatgcg tgagtgcatc tccatccacg 12 0 

ttggccaggc tggtgtccan attggcaatg cctgctggga gctctactgc ctggaacacg 180 
gcatccagcc cgatggccag atgccaagtg acaagaccat tgggggagga gatgactcct 240 
tcaacacctt cttcagtgag acgggcgctg gcaancacgt gccccgggct gtgtttgtag 300 
acttggaacc cacagtcatt gatgaagttc gcactggcac ctaccgccag ctcttccacc 360 
ctgagcagct catcncaggc aaggaagatg ctgccaataa ctatgcccga gggcactaca 4 20 

ccattggcaa ggagatcatt gaccttgtgt tggaccgaat tcgcaagctg gctgaccant 4 80 
gcaccggtct tcanggcttc ttggttttcc acagctttgg tgggggaact ggttctgggt 54 0 

tcacctccct gctcatggaa cgtctctcag ttgattatgg caagaaatcc aagctggagt 600 
tctccattta cccagcaccc cnggtttccn cngctgtant tngaa 64 5 

<210> SEQ ID N^:^^6 
<211> LENGTH: ^20 
<212> TYPE: DNJt^ 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 

<2 2 2> LOCATION: (1)...(?2^) ^ 
<223> OTHER INFORMATION^ = A,T,C or G^V 
<400> SEQUENCE: 6 

cttttttcgc aacgggtttg ccgccagaac acaggtgtcg tgaaaactac ccctaaaagc 60 
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126 caaaatggga aaggaaaaga ctcatatcaa cattgtcgtc attggacacg tagattcggg 120 

127 caagtccacc actactggcc atctgatcta taaatgcggt ggcatcgaca aaagaaccat 180 

128 tgaaaaattt gagaaggagg ctgctgagat gggaaagggc tccttcaagt atgcctgggt 240 

129 cttggataaa ctgaaagctg agcgtgaacg tggtatcacc attgatatct ccttgtggaa 300 

130 atttgagacc agcaagtact atgtgactat cattgatgcc ccaggacaca gagactttat 360 

131 caaaaacatg attacaggga catctcaggc tgactgtgct gtcctgattg ttgctgctgg 420 

132 tgttggtgaa tttgaagctg gtatctccaa gaatgggcag acccgagagc atgcccttct 480 
W--> 133 ggcttacaca ctgggtgtga aacaactaat tgtcggtgtt aacaaaatgg attcactgan 54 0 
W--> 134 ccaccctaca gccagaagaa atatgangaa attgttaagg aagtcagcac ttacattaag 600 
W--> 135 aaaattggct acaaccccga cacagtanca tttgtgccaa tttctggttg gaatggtgac 660 
W--> 136 aacatgctgg aaccaantgc taacatgcct tggttccagg gatggaaaat cccccnttaa 720 
W--> 137 ggatggcnat gccattggaa cccccctgct tgaaggctct ggantgcatc ctancaccaa 780 
W--> 138 ctccttcaaa ttgaaaaacc ccttgcnccc gcctccncca 820 

140 <210> SEQ ID NO:^ 

141 <211> LENGTH 

142 <212> TYPE 

143 <213> ORGANISM: Homo sapiens 

145 <220> FEATURE: 

146 <221> NAME/KEY: misc_fe^are 

147 <222> LOCATION: {1) . . . (fisy) 

148 <223> OTHER INFORMATIONS'^ = A,T,C or qO^ 

150 <400> SEQUENCE: 7 

151 gaggctgagg cagtggctcc ttgcacagca gctgcacgcg ccgtggctcc ggatctcttc 60 

152 gtctttgcag cgtagcccga gtcggtcagc gccggaggac ctcagcagcc atgtcgaagc 120 

153 cccatagtga agccgggact gccttcattc agacccagca gctgcacgca gccatggctg 180 

154 acacattcct ggagcacatg tgccgcctgg acattgattc accacccatc acagcccgga 240 

155 acactggcat catctgtacc attggcccag cttcccgatc agtggagacg ttgaaggaga 300 

156 tgattaagtc tggaatgaat gtggctcgtc tgaacttctc tcatggaact catgagtacc 360 

157 atgcggagac catcaagaat gtgcgcacag ccacggaaag ctttgcttct gaccccatcc 4 20 

158 tctaccggcc cgttgctgtg gctctagaca ctaaaggacc tgagatccga actgggctca 4 80 

159 tcaagggcag cggcactgca gaggtggagc tgaagaatgg agccactctc aaaatcacgc 540 

160 tggataatgc ctacatggaa aagtgtgacg agaacatcct gtggctggac tacaagaaca 600 
W--> 161 tctgcaaggt ggtggaagtg ggcaacaaga tctacgtgga tgatgggctn atttctctcc 660 
W--> 162 aggtgaacac aaaggtgccg acttcctggg tgacngangt ggaaaatggt ggctccttgg 720 
W--> 163 gcncaagaaa ggtgtgaact tcctggggct gctgtggant tgcctgctgt gtcngaaaaa 780 

164 gacatcca 788 

166 <210> SEQ ID NOi-^ 

167 <211> LENGTH: ^08/ 

168 <212> TYPE: DNA^ 

169 <213> ORGANISM: Homo sapiens 

171 <220> FEATURE: 

172 <221> NAME/KEY: misc_fepimre 

173 <222> LOCATION: (1) . . . /gOS)) Ay 

174 <223> OTHER INFORMATIONT^ = A;T,C or gO 

176 <400> SEQUENCE: 8 

177 acagcctggc tcctttgagt atgaatatgc catgcgctgg aaggcactca ttgagatgga 60 
W--> 178 gaagcagcag caggaccaag tggaccgcaa catcnaggag gctcgtgaga agctggagat 120 

179 ggagatggaa gctgcacgcc atgagcacca ggtcatgcta atgagacagg atttgatgag 180 
W--> 180 gcgccaagaa gaactticgga ggatggaaga gctgcacaac caagangtgc aaaaacgaaa 240 
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PATENT APPLICATION: US/09/529,962 TIME: 09:51:12 

Input Set : A:\06501-058001.TXT 
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gcaactggag ctcaggcagg aggaanagcg caggcgccgt gaagaanaga tgcggcggca 300 

gcaagaagaa atgatgcggc gacngcagga aggattcaag ggaaccttcc ctgatgcgag 360 

agagcaggag attcggatgg gtcngatggc tatgggaggt gctatgggca t:aaacnacag 4 20 

atgtgccatg ccccctgctc ctgtgccagc tggtacccca gctcctccag gacctgccac 4 80 

tattatgccg gatggaactt tgggattgac cccaccnaca actgaacgct ttggtcnggc 54 0 

tgctacnatg gaangaattg gggcaattgg tggaactcct cctgcattcn accgtgcagc 600 

tcctggga 608 

<210> SEQ ID NQ^-->9 

<211> LENGTH: ms) 

<212> TYPE: DNA""^^ 

<213> ORGANISM: Homo sapiens 

<220> FEATURE: 

<221> NAME/KEY: misc_f eajfepre 

<222> LOCATION: (l).../60l^ , 
<223> OTHER INFORMATIONTn = A,T,C or G 
<400> SEQUENCE: 9 

acagcctggc tcctttgagt atgaatatgc catgcgctgg aaggcactca ttgagatgga 60 

gaagcagcag caggaccaag tggaccgcaa catcnaggag gctcgtgaga agctggagat 120 

ggagatggaa gctgcacgcc atgagcacca ggtcatgcta atgagacagg atttgatgag 180 

gcgccaagaa gaacttcgga ggatggaaga gctgcacaac caagangtgc aaaaacgaaa 24 0 

gcaactggag ctcaggcagg aggaanagcg caggcgccgt gaagaanaga tgcggcggca 300 

gcaagaagaa atgatgcggc gacngcagga aggattcaag ggaaccttcc ctgatgcgag 360 

agagcaggag attcggatgg gtcngatggc tatgggaggt gctatgggca taaacnacag 420 

atgtgccatg ccccctgctc ctgtgccagc tggtacccca gctcctccag gacctgccac 4 80 

tattatgccg gatggaactt tgggattgac cccaccnaca actgaacgct ttggtcnggc 540 

tgctacnatg gaangaattg gggcaattgg tggaactcct cctgcattcn accgtgcagc 600 

tcctggga 608 
<210> SEQ ID NO^^O 
<211> LENGTH: /^Ip 
<212> TYPE: DNAr-^ 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 

<221> NAME/KEY: misc_fea^^e 
<222> LOCATION: (1) . . . ^813/) 
<223> OTHER INFORMATIOnV^ = A,T,C or G 
<400> SEQUENCE: 10 

gttgtggtat ctgtattaag aaatgcccct ttggcgcctt atcaattgtc aatctaccaa 60 

gcaacttgga aaaagaaacc acacatcgat attgtgccaa tgccttcaaa cttcacaggt 120 

tgcctatccc tcgtccaggt gaagttttgg gattagttgg aactaatggt attggaaagt 180 

caactgcttt aaaaatttta gcaggaaaac aaaagccaaa ccttggaaag tacgatgatc 240 

ctcctgactg gcaggagatt ttgacttatt tccgtggatc tgaattacaa aattacttta 300 

caaagattct agaagatgac ctaaaagcca tcatcaaacc tcaatatgta gaccagattc 360 

ctaaggctgc aaaggggaca gtgggatcta ttttggaccg aaaagatgaa acaaagacac 420 

aggcaattgt atgtcagcag cttgatttaa cccacctaaa agaacgaaat gttgaagatc 4 80 

tttcaggagg agagttgcag agatttgctt gtgctgtcgt ttgcatacag aaagctgata 540 

ttttcatgtt tgatgagcct tctagttacc tagatgtcaa gcagcgttta aaggctgcta 600 

ttactatacg atctctaata aatccagata gatatatcat tgtggtggaa catgatctaa 660 

gtgtattaga ctatctctcc gacttcatct gctgtttata tggtgtacca agcgcctatg 720 

gaattgtcac tatgcctttt agtgttagaa aaggcataaa cnttttttgg atgggtatgt 780 
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W--> 236 tccaacagaa aacttganaa tcnnaaatgc ntc 813 

238 <210> SEQ ID NC 

239 <211> LENGTH: ^ 

240 <212> TYPE: DNA 

241 <213> ORGANISM: Homo sapiens 
24 3 <220> FEATURE: 

244 <221> NAME/KEY: niisc_f eatw^e 

245 <222> LOCATION: (l)...(p5/ 

246 <223> OTHER INFORMATIOnV-Ii = A,T,C or G 

248 <400> SEQUENCE: 11 

249 agactctcac cgcagcggcc aggaacgcca gccgttcacg cgttcggtcc tccttggctg 60 

250 actcaccgcc ctcgccgccg caccatggac gcccccaggc aggtggtcaa ctttgggcct 120 

251 ggtcccgcca agctgccgca ctcagtgttg ttagagatac aaaaggaatt attagactac 180 

252 aaaggagttg gcattagtgt tcttgaaatg agtcacaggt catcagattt tgccaagatt 240 

253 attaacaata cagagaatct tgtgcgggaa ttgctagctg ttccagacaa ctataaggtg 300 

254 atttttctgc aaggaggtgg gtgcggccag ttcagtgctg tccccttaaa cctcattggc 3 60 

255 ttgaaagcag gaaggtgtgc ggactatgtg gtgacaggag cttggtcagc taaggccgca 420 

256 gaagaagcca agaagtttgg gactataaat atcgttcacc ctaaacttgg gagttataca 480 
W--> 257 aaaattccag atccaagcac ctggaacctc aacccanatg cctcctacgt gttttattgc 540 
W--> 258 ncaaatgaaa cggtgcatgg tgttganttt gactttatac ccnatgtcaa gggaacanta 600 
W--> 259 ctggtttgtg acattttcct ccaacttcct gtccaancca attgnatgtt tccaa 655 

261 <210> SEQ ID NOl 12 

262 <211> LENGTH: ^9 

263 <212> TYPE: DNA"^ 

264 <213> ORGANISM: Homo sapiens 

266 <220> FEATURE: 

267 <221> NAME/KEY: misc_f eafture 



268 <222> LOCATION: (1)...(59^) 



269 <223> OTHER INFORMATIONT^ = A,T,C or G 0 

271 <4 00> SEQUENCE: 12 

272 aaagatgcgc aggcgccgtg tggcactcgg cggtcgaaag gggagttcaa ggagacgggg 60 

273 gcgacgcggc tgagggcttc tcgtcggggt cggggctgca gccgtcatgc cggggatagt 120 

274 ggagctgccc actctagagg agctgaaagt agatgaggtg aaaattagtt ctgctgtgct 180 

275 taaagctgcg gcccatcact atggagctca atgtgataag cccaacaagg aatttatgct 240 
W--> 276 ctgccgctgg gaanagaaag atccgaggcg gtgcttagag gaaggcaaac tggtcaacaa 300 

277 gtgtgctttg gacttcttta ggcagataaa acgtcactgt gcagagcctt ttacagaata 360 

278 ttggacttgc attgattata ctggccagca gttatttcgt cactgtcgca aacagcaggc 420 
W--> 279 aaagtttgac nagtgtgtgc tggacaaact gggctgggtg cggcctgacc tgggaaaact 4 80 
W--> 280 gtcaaaggtc accaaagtga aaacagatcn acctttaccg ganaatccct atcactcaag 540 
W--> 281 aacaagaacg gatcccagcc ctganatcna aggaaatctg cancctgcca cacatggca 599 

283 <210> SEQ ID 




284 <211> LENGTH 

285 <212> TYPE: 
2 86 <213> ORGANISM: Homo sapiens 

288 <220> FEATURE: 

289 <221> NAME/KEY: misc_fe^tife 

290 <222> LOCATION: 



(1).. .((597/) 

291 <223> OTHER INFORMATrONr-ft = A,T,C or G ^ 
293 <400> SEQUENCE: 13 
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Input Set : A:\06501-058001.TXT 
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L:15 M:271 C 
L:62 M:341 W 
L:83 M:341 W 
L:84 M:341 W 
L:85 M:341 W 
L:86 M:341 W 
L:87 M:341 W 
L:88 M:341 W 
L:89 M:341 W 
L:102 M:341 W 
L:104 M:341 W 
L:106 M:341 W 
L:108 M:341 W 
L:109 M:341 W 
L:110 M:341 W 
L:112 M:341 W 
L:133 M:341 W 
L:134 M:341 W 
L:135 M:341 W 
L:136 M:341 W 
L:137 M:341 W 
L:138 M:341 W 
L:161 M:341 W 
L:162 M:341 W 
L:163 M:341 W 
L:178 M:341 W 
L:180 M:341 W 
L:181 M:341 W 
L:182 M:341 W 
L:183 M:341 W 
L:185 M:341 W 
L:186 M:341 W 
L:201 M:341 W 
L:203 M:341 W 
L:204 M:341 W 
L:205 M:341 W 
L:206 M:341 W 
L:208 M:341 W 
L:209 M:341 W 
L:235 M:341 W 
L:236 M:341 W 
L:257 M:341 W 
L:258 M:341 W 
L:259 M:341 W 
L:276 M:341 W 
L:279 M:341 W 
L:280 M:341 W 
L:281 M:341 W 



Current Filing Date differs. Replaced Current Filing Date 
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VERIFICATION SUMMARY DATE: 08/13/2001 

PATENT APPLICATION: US/09/529,962 TIME: 09:51:13 

Input Set : A: \06501-058001 , TXT 

Output Set: N:\CRF3\0813200I\I529962.raw 

L:302 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 
L:303 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 
L:319 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 14 
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